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Abstract 

Optimal pursuit -evasion problem between two aircraft including a realistic weapon envelope is analyzed 
using differential game theory. This study employs sixth order nonlinear point mass vehicle models 
and allows the inclusion of an arbitrary weapon envelope geometry. The performance index is a linear 
combination of flight time and the square of the vehicle acceleration. Closed form Solution to this high- 
order differential game is then obtained using feedback linearization. The solution is in the form of a 
feedback guidance law together with a quartic polynomial for time-to-go. Due to its modest computational 
requirements, this nonlinear guidance law useful for on-board real-time implementation. 


Introduction 

The objective of this paper is to develop an optimal nonlinear guidance scheme for aircraft pursuit- 
evasion using differential game theory [1] and the theory of feedback linearization [2,3]. This work extends 
previous research [4] to include time- of- flight in the performance index together with a realistic weapon 
envelope for the pursuing aircraft. Six state nonlinear point mass models of aircraft with lift, bank angle 
and throttle controls sue employed in this work. The weapon envelope considered is an arbitrary three 
dimensional manifold with its origin at the vehicle center of gravity. This manifold may be specified as 
a function of the angle between the line-of-sight vector and the vehicle velocity vector, see Figure 1 for 
details. The distance between the two aircraft is then redefined as the difference between the relative 
position vector and components of the weapon effectiveness range measured from the pursuer’s center of 
gravity. The pursuer attempts to drive this distance to zero while the evader attempts to make it as large 
as possible. The pursuit -evasion game terminates the first time instant this distance vector becomes zero. 
Both vehicles seek to accomplish their objectives in a time- optimal fashion while satisfying the limits on 
permissible acceleration levels. It is assumed in the present research that the evader has no offensive 
capabilities. As a result, this analysis includes only the pursuer’s weapon envelope. Note that alternate 
approaches [5-8] may be required in the case where both vehicles have offensive capabilities. Additionally, 
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unlike Reference 5, it is assumed here that the weapon envelope cannot be oriented independent of the 
pursuer’s velocity vector. 

In the past, differential games of this nature could only be handled by linearizing the vehicle dynamics 
[9, 10] or by the use of simplified models [7, 12]. Differential game solutions obtained using linearized 
vehicle dynamics is of dubious value since it is not reasonable to define a nominal trajectory. Use of 
simplified vehicle models, on the other hand, can result in optimistic or pessimistic results depending on 
the region of validity of these models. 

In practical applications, these factors have led to the formulation of such guidance problems as 
one-sided optimization problems, accurate only for dealing with nonmaneuvering targets. In order to 
handle actively maneuvering targets, it is necessary to formulate them as differential games. In this case, 
however, the numerical complexities preclude real-time implementation. Alternate approaches to these 
problems consist of off-line construction and storage of a field of extremals with real-time interpolation 
[ 12 ]. 

In Reference 4, it was shown that feedback solutions are feasible for a class of nonlinear differential 
games arising in aircraft pursuit-evasion. In that work, the performance index was required to be a 
quadratic form in the distance between two vehicles and the square of the magnitude of their acceleration. 
In [4], the nonlinear aircraft models were first transformed into a linear time-invariant form. A differential 
game was then formulated in transformed coordinates and solved. The resulting guidance law was 
transformed back to the original space to obtain the nonlinear pursuit-evasion guidance law. This research 
has been subsequently extended to study spacecraft pursuit-evasion and rendezvous problems [13, 14]. 
Solution to the differential game in this form requires the knowledge of time-to-go. In vehicle models 
without actuator saturation, this quantity can be computed exactly as outlined in Reference 13. Recently, 
the solution given in Reference 4 has been evaluated on a realistic simulation of high performance aircraft 

The objective of the present research is to extend this methodology to include time-of-flight in the 
performance index and a realistic weapon envelope in the model for the pursuing aircraft. Note that 
the results presented here may be adapted to one-sided guidance problems such as those discussed in 
References 16 and 17. 

Nonlinear Models for the Pursuer and the Evader 

The point-mass equations of motion for an aircraft are given by 


T(h,M,ri)- D(h,M,L) 

— — — — g sin 7 

m 

(i) 

# L $in<f> 

Y = 

mV cos 7 

(2) 

g .L cos<f> 

7 = v ( w COS7) 

(3) 

x = V cos 7 cosx 

(4) 

y — V cos 7 sin x 

(5) 

h = V sin 7 

(6) 
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The salient assumptions in this model are a flat non-rotating earth, thrust along the path and a quiescent 
atmosphere. In the equations (l)-(6), V is the airspeed, 7 the flight path angle, X heading angle, T the 
vehicle thrust, D vehicle drag, L the lift, g acceleration due to gravity, M the Mach number, and m is 
the vehicle mass. Position of the aircraft in an earth-fixed, inertial frame is given by the down-range x, 
cross-range 3/, and altitude h. The control variables in this model are the vehicle lift X, bank angle <£, 
and the throttle setting 77. Note that the vehicle thrust is specified as a nonlinear function of altitude, 
Mach number and the throttle setting. 

The aerodynamic drag is calculated using the drag coefficient Cp, the airspeed V, the atmosphere 
density p and the reference area s as 

D = C D spV 2 / 2 (7) 

In (7), the drag coefficient Co is a nonlinear function of the Mach number M and the lift X. 

The equations of motion for the evader is in the same form as that of the pursuer. However, the 
thrust and drag characteristics may be different. 

The Pursuer’s Weapon Envelope 

The present analysis will include a weapon envelope only for the pursuer. The evader is not assumed to 
have any offensive capabilities. A more complex formulation will be essential if one assumes the existence 
of offensive capabilities for the evader. In such a differential game, each participant may attempt to 
maxim ize the distance between itself and the other vehicle’s weapon envelope, while attempting to drive 
the adversary into its own weapon envelope. Such a formulation may lead to the study of combat games 
or two-target games [5-8]. The present research will not address these issues. In all that follows, it will 
be assumed that the roles of each participant in the game is fixed and remains unchanged for the entire 
duration of the engagement. 

As indicated in the foregoing, the evader is not assumed to have any offensive capabilities. As a 
result, if the pursuer is successful in bringing the evader within its weapon effectiveness range, then 
capture is said to have occurred. In the present research, pursuer’s weapon envelope is assumed to 
be a three dimensioned manifold with its origin located at the vehicle center of gravity. The weapon 
effectiveness range is defined as the distance between the vehicle center of gravity and the intersection of 
the three dimensional manifold defining the weapon envelope with the line-of- sight vector. Details are 
shown in Figure 1. The weapon effectiveness range is assumed to be a function of the angle between the 
vehicle velocity vector and the line-of-sight vector. Note that this modeling is consistent with the weapon 
usage envelope in currently operational fixed wing fighter aircraft. 

It is assumed here that the weapon envelope cannot be oriented independent of the vehicle velocity 
vector. In flight vehicles such as combat helicopters and fighter aircraft with precision fuselage pointing 
capabilities, an alternate assumption might be more appropriate. Clearly, such capabilities provide 
additional degrees of freedom in controlling the outcome of the differential game. 

In a chosen inertial frame, if the pursuer’s velocity vector and the differential position vector between 
the pursuer and the evader are \x p y p h p ] T and [Ax Ay Ah] T respectively, then the angle between the 
pursuer’s velocity vector and the line-of-sight vector may be computed using the expression: 


cos 8 — 


I 'm p — w 

/A* 2 + Ay 2 + A/i 2 ^ 2 + y 2 + A 2 


In the expression (8), A/i = h e - h p , Ax — x t - x p , Ay = y e — y p . The angle S is sometimes termed 
as the line-of-sight angle. The weapon effectiveness range R w can then be specified as a function of the 
line-of-sight angle as : 
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Ry, = F{6) ( 9 ) 

For example, if the weapon, effectiveness envelope were a cone centered at the vehicle center of gravity, 
the weapon range may be described as follows: 

Rm — r , if \6\<6 max (10) 


Ry, — 0, otherwise (11) 

The quantity r is a specified constant. A prolate spheroid with its major axis oriented along the vehicle 
velocity vector appears to be a more realistic weapon envelope shape. Note that in this case, the weapons 
will be effective to a certain degree in the tail aspect also. This is consistent with the existing tactical 
weapon effectiveness envelopes in operational fighter aircraft. Such a weapon envelope is illustrated in 
Figure 1. In this case, it is possible to write an explicit expression weapon effectiveness range as 


Ryj — 


( 12 ) 


l + B cos 6 

In this expression, A and B are two constants specifying the size and shape of the prolate spheroid. 
These constants may be related to the minimum and maximum weapon ranges Rmin, Rmax as: 


A = 


2 Urn in ^ma 
Rmin + Rm a 


(13) 


B = 


Rmin Rm ax 

Rm m “f* Rmax 


(14) 


Note that the present specification of the weapon envelope can include a kill probability distribution given 
as a function of the weapon range. Further, any alternate shape for the weapon effectiveness envelope 
can be included in the ensuing analysis. If the orientation of the line-of-sight vector in the given inertial 
frame is defined using two angles 0 and y such that 


0 = tan 1 


A h 

i/Ax 2 + Ay 2 


(15) 


/x = tan (16) 

Ax 

the components of the weapon effectiveness range can be resolved into three components in the earth 
fixed frame as 


hy, = R w sin 0 

(17) 

x w = Ry, cos 0 cos p 

(18) 

y w = Ry, COS 0 sin p 

(19) 


Next, these quantities may be used to redefine the three components of the relative position between 
the pursuer and evader as : 

zi - x p + ex w - x t (20) 
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z 2 = y P 4- ey w - Vt 


( 21 ) 


zz = hp + ehw ~ he ( 22 ) 

The variable e multiplying the weapon envelope components is included in the foregoing to enable the 
adjustment of the relative weighting between the weapon envelope and the distance between the two 
vehicles. Such a trade-off is useful while considering the use of different weapon systems in a given 
pursuit- evasion scenario. From equations (20)-(22), it may be observed that the distance between the 
two vehicles can be altered either by changing the distance between the vehicle center of gravities or 
by orienting the weapon envelope. Both these quantities are directly influenced by the vehicle relative 
position, velocity, and acceleration components. Note that if the weapon envelope was independently 
orientable, then this relative distance will depend additionally on the orientation parameters. However, 
in all that follows, it will be assumed that the weapon envelope cannot be oriented independent of the 
vehicle velocity vector. 

Next assume that the quantities t/ w , h w and their various time derivatives are available from 
on-board measurements. Examining the geometric relations (19)-(21), it may be observed that these 
quantities can be computed if the angles <5, 0, and their various time derivatives are available. The 
assumption that these quantities are available from measurements is crucial for including an arbitrary 
shaped weapon envelope in the derivation of the present guidance law. If these quantities are not available 
from on-board measurements, then only a spherical weapon envelope shape can be employed in the 
analysis. This is because of the fact that the second derivative of the line-of-sight angle depends on 
jerk. The presence of this term introduces difficulties in transforming the vehicle model into linear, 
time-invariant form as will be apparent in the next section. 

An interesting extension of the formulation discussed in this section is the inclusion of a sensor 
effectiveness envelope in the analysis. In such a pursuit-evasion game, the maneuvers will not only be 
influenced by weapon envelope considerations, but also by information trade-offs. A familiar example of 
the control-information mini- maximization is the timed maximum lateral acceleration maneuver employed 
by fighter aircraft for missile evasion. The objective here is to take advantage of the fact that missile 
seeker tracking rate as well as the missile maneuvering capabilities are limited. Following the proposed 
methodology for the inclusion of the weapon envelope, it is possible to include a sensor effectiveness 
envelope in the present formulation. Additional situations where such constraints as these arise include 
the guidance of robots in the presence of workspace envelope and actuator constraints. These issues will 
not be pursued any further in the present paper. 


Feedback linearization 

The chief difficulty in obtaining solutions to differential games using realistic flight vehicle models 
is that these models are highly nonlinear. Moreover, classical linearization approach using Taylor series 
expansion is invalid in a differential game setting [7], primarily due to the difficulty in defining a nominal 
trajectory. However, it can be shown[4] that if the differential game between the two vehicles is formu- 
lated in terms of the position state variables and their various derivatives, then feedback linearization 
approach can be used to obtain a nonlinear feedback solution. The primary thrust of the present re- 
search is to extend the work in Reference 4 to include the time-of- flight in the performance index and 
a weapon envelope in the vehicle model. Feedback linearization is then used to transform the vehicle 
dynamic models into a linear, time-invariant form. The pursuit-evasion game is formulated in terms of 
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the transformed states and solved to obtain the guidance law. Inverse transformation of this guidance 
law to the original coordinates produces an implementable nonlinear guidance scheme. 

As in Reference 4, feedback linearization is accomplished by differentiating the equations (20)-(22) 
twice with respect to time and substituting for F,7,x from expressions (l)-(3). interpreting the right 
hand sides of the resulting expressions as the new control variables in the problem, one has : 

11 = Z u Z x = Ui + eVi - W x (23) 

1 2 = Z 2 , Z 2 = U 2 + eV 2 - W 2 (24) 


i 3 — Zzi Zz = Uz + CV3 — Wz 


where 


_ (T. - fl>) 

m p 

(T — . D ) L 

U 2 = — — cos 7p sin Xp H — (cos x P sin <f> p - sin 7 P sin x P cos <f> p ) 


U\ = — — cos 7 P cos Xp £ (sin7 P cos x P cos <p p + sin x P sin 4> p ) 

Trip trip 


Trip Trip 

tt ^ p ± 

U$ - — sm7p + — cos 7 P cos <p p - g 

TXlp TTlp 

Vi = x w , V 2 = y w , Vz — ft® 


(25) 

(26) 

(27) 

(28) 
(29) 


(T — D 1 L 

W\ - — — COS 7 e COS x« -(sin7e cos Xe cos <f> e + sin Xe sin 4> e ) 

m e Tn e 

(T - n ) jj 

W 2 = — — cos 7e sin Xe 4 -(cos x« sin <f> e - sin 7 e sin x« cos </>«.) 


m. 


m. 


(30) 

(31) 


jxr (^e — -^e) • ft • 

W3 = i sm 7« H cos 7e cos <p e — g (32) 

m e m e 

Note that the new control variables Ui, Wi, i = 1, 2, 3 depend on the system states and the original control 
variables. If the parameter e is set to zero, the model (23)-(25) will turn out to be identical to that given 
in Reference 4. It is assumed here that the various derivatives of the weapon envelope components 
x w , y w , h w are available from on-board measurements. In the general case, these derivatives will depend 
on the vehicle position, velocity, acceleration, jerk and various time derivatives of jerk. As a result, if 
these quantities are not available as measurements, the feedback linearization implied by the expressions 
(23)-(25) will not be feasible. In that case, the ensuing analysis will permit only the use of a spherical 
weapon envelope. 

With the interpretation of U\ + eVi, U 2 + eV 2 , U 3 + eVz,Wi,W 2 ,W 3 as the new control variables, the 
equations (23)-(25) describe a linear time-invariant system. Given the pseudo-control variables, the real 
control variables in the system can be computed using the relations [4]: 


<f> p — tan 1 


' Uz cos Xp — Ui sin x P 

.cos 7 p (( 7 3 + g) - sin7 P (C/i cosx P + U 2 sin x P ). 


(33) 
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( 34 ) 


_ m p [cos7 P (^3 + 9) — sin 7 € {U\ cosx P + U 2 sinx P )] 
Lp ~ cos <pp 


Tp — 


sin7 P (I73 + 9 ) + cos7 P (I/i cosx P + U 2 sinx P ) rn p + D p 


(35) 


The corresponding expressions for the evader’s control variables can be obtained by replacing U\, U 2 , U 3 
by W\, W 2 , W 3 and changing the state variable subscripts as follows : 


<f> e = tan 

L e = 




W 2 cos Xe - W\ sin x« 


COS 7 e(W 3 + g) - sin7e(W 1 COSXe + sinXe)J 
m e [cos7 e (W 3 + 9) ~ 8in7«(Wi cosXe + ^2 sinx e )] 

COS <f>e 


(36) 

(37) 


T,= 


sin 7 e(W 3 + g) + COS7 e (Wi COSXe + W2 sinXe ) ^e + Be 


(38) 


Physically, the pseudo- control variables Uu t = 1,2,3 are the acceleration components of the vehicle 
in the chosen inertial frame. Since the magnitude of a vector is invariant under coordinate transformation, 
the magnitudes of Ef, W are also the pursuer-evader acceleration magnitudes in the flight path axis system. 

The attitude dynamics of the pursuer and the evader were not included in the foregoing analysis. 
Note that these may be included at the expense of increased model complexity. The primary reason for 
including these in a differential game would be to study the various information trade-offs involved in a 
typical pursuit- evasion scenario. For instance, if the pursuer’s weapons employed an active radar seeker, 
then its maneuvers would be influenced by the fact that the maximum target area must be visible at all 
times. The evader, on the other hand, would employ an opposite strategy. Several interesting variations 
of this differential game can be studied in the present setting. 


Guidance Law for Aircraft Pursuit-Evasion 

The previous section dealt with an approach for making the nonlinear aircraft models amenable 
to analysis. In this section, the pursuit -evasion differential game will be formulated in the transformed 
coordinates and solved. Inverse transformation of this solution to the original coordinates yields the 
nonlinear feedback guidance law. In the following development it will be assumed that all the state 
variables req uir ed for computing the feedback law are known perfectly. Once the differential game is 
solved with perfect information, the effects of incomplete or imperfect information can be investigated 
using this solution. 

The first issue in differential games is that of role definition. There is a controversy on this issue 
currently. But if the roles sue assigned at the outset, the resulting differential game is amenable to 
analysis via Isaac’s theory [ 1 ], Assuming that the roles have been defined, the objective of the pursuer is 
to minimize the specified performance index which the evader tries to maximize. The performance index 
employed in the present research is: 

min max [ * (c + — - — (a[(C/i + eFi ) 2 + ( U 2 + eVi) 2 + (C /3 + cVs) 2 ] 

(U+tV) w Jo P 2 L 

-b[W! 2 + W 2 2 + W 3 ^dt (39) 
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The final time tj is unspecified. This performance index is to be optimized by the two participants 
subject to the differential constraints (23)-(25). In (39), C defines the relative weighting between flight 
time and acceleration magnitude, while the positive quantities a and b serve to constrain the acceleration 
magnitudes of the pursuer and the evader. For reasons that will be made clear in the ensuing, it is 
assumed that 


b > a, 0<C<1 ( 4 °) 

The negative sign in front of the evader’s acceleration term explicitly identifies this player as the 
maximizer. Ini tial values on all the state variables are assumed known. The pursuit-evasion maneuvers 
are terminated the first time instant the evader makes contact with the pursuer’s weapon envelope, viz. 

*1 /, Z2f, z 3f = 0 (41) 

where 


z lf — Z 1 (*/)> z 2/ = z 2(*/)> z 3f = z 3 (*/) 

The final value of the relative velocities Z x , Z 2 , Z 3 are assumed to be free. Next, define the variational 
Hamiltonian as: 


H = C + |(1 - C){«[(tfi + *Vi) 2 + ( U 2 + cVi) 2 + ( U 3 + eVi) 2 ] - b[W x 2 + W 2 2 + W 3 2 ]} 

+A X Z X + A 2 Z 2 + A3Z3 + A 4 (Cfi + eVi - W x ) + A 5 (d 2 + eV 2 - W 2 ) + X 6 (U 3 + eV 3 - W 3 ) (42) 

As in Reference 4, the objective of the present research is obtain a saddle-point solution to the 
differential game. The conditions under which such a solution may exist are well known [18]. The central 
requirement here is the separability of the variational Hamiltonian with respect to the pursuer and the 
evader state and control variables. In the present case, inspection of the variational Hamiltonian defined 
in (38) will reveal that such a separability exists. The saddle point solution can be found by proceeding 
formally as follows. 

The costate equations [10] for this problem can be obtained as 

= X 2 = A 3 = 0 (43) 

A4 = — Ai, Ag — — A2, Ae = —A3 (44) 

The optimality conditions yield : 

Ui + «Vi = — (45) 
a(l -C) 

Ut + eVl = 0 ( 1^0 (46) 

t ' 3+ ^ = ^o (47) 
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(48) 


Wi = 

W 2 = 

w z = 


-A 4 

6(1 - C) 

-As 

6(1 - C) 

-A« 


(49) 


(50) 


6(1-C) 

Since the final values of Z\, Z 2 , Z z are free, the costates A4, Ag, Ag are zero at the final time. This 
fact, together with equations (45)-(50) imply that the pseudo control variables are all zero at the final 
time. Integrating the costate equations (44) and using the boundary conditions on A4, Ag,Ag yields 


A4 = Ai(t/ ~ 0 
Ag = A 2 (t/ — t) 


(51) 

(52) 


Ae = A z{tf — t) (53) 

Using the expressions (51) - (53) to eliminate the costates on the right hand side of the optimality 
conditions (45) - (50) one has : 



(54) 


(55) 


(56) 


Expressions (54)-(56) may next be integrated to obtain Z\, Z 2 , Z 3 . Due to the symmetry of the solution 
to this problem, only one component of the solution will be fully illustrated in the ensuing. Thus : 




Integrating the expression (57) yields 

Zi{t) = Zi(0) + Zi(0)t + 


A x r l l w t 2 t* 


(57) 


(58) 


(1-C) 6 a JW 2 6 

Using the game termination condition (41) in the expression (58) yields 

A i = -(l 1 _6) < ) / a 3 6 [ Zl( °) +Zl( °)^ 

This may next be substituted in the optimality conditions to obtain the optimal control for the 
pursuer as 


(59) 


Ui + eV 1 


36 

(a - b)t f s 


2i(0) + Z^tfUtf — t) 


(60) 
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(61) 


The optimal control for the evader is given by 

m = (a _ 3 ° (/ 5 [21(0) + z.(o )«/](</ - <) 

The remaining control variables in the problem can be similarly computed. The solution is incomplete 
at this stage since the final time t } is unknown. This quantity may be computed by invoking a constant 
of motion in this problem. Since the final time is open and the variational Hamiltonian is autonomous, 
one has that 


H(t) = 0 (62) 

Substituting for the optimal controls in terms of costates in the constant of motion (62) at the initial 
time results in 

0 = c+ 2(I^o i KM V + V + V } 

+AjZi( 0) + A2^2(0) + AaZ3(0) (63) 

Next, substituting for A 4 , A 5 , A« from expressions (51) - (53) yields 

0 = C + 2(1^0^ ” M A, ‘ + V + A3l }‘ ,! 

+AiZi(0) + A 2 Z 2 (0) + A 3 Z 3 (0) (64) 

Finally, substituting for Ai, A 2 , A 3 in terms of initial states from expression (59) yields a quartic polynomial 
of the form 


tf A + ? 2^/ 2 + gi if + go — 0 

(65) 

qz - [^ 1 ( 0 ) +^ 2 ( 0 ) + £ 3 ( 0 ) ]/ 3 ^ _ ^ a ] 

( 66 ) 

ft - 4[Z 1 (0)z 1 (0) + Z 2 (0)z 2 (0) + Z 3 (0)z 3 (0)]/ 3(1 C _ ^ J 

(67) 

go - 3[zx(0) 2 + z 2 (0) 2 + *3(0) 2 ]/ 3(1 5 c) [ 6 a ] 

(68) 


The smallest positive value of the final time emerging from the polynomial (65) should be used in 
subsequent calculations. Note that the parameter C should satisfy the in equality (40) to ensure that 
these polynomial coefficients remain finite. Additionally, the polynomial coefficients go and <72 will be 
negative if b > a. Since the coefficient corresponding to tf Z is zero and the coefficients go, <72 are 
negative, the Hurwitz criterion [19] in the theory of polynomials implies that this polynomial has roots 
with positive real parts. Next, forming the Routh array [19], the first column turns out to be 

[1, k, -qi/K, qu 9o] T ( 69 ) 

Here, k is a small positive parameter a nd the superscript T denotes the transpose. If gi is less than 
zero, this array suggests that one root of the polynomial (65) will have a positive real part. On the other 
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hand, if q x > 0, three roots of this polynomial will have positive real parts. Since complex roots always 
occur in conjugate pairs, the foregoing observations imply that the polynomial (65) will produce at least 
one usable value of t f . Note that the final time t/ has to be iteratively determined. However, since 
the objective is to determine the smallest positive real root of this polynomial, a one dimensional search 
scheme is adequate. 

An interesting special case occurs if 51 were zero. Note that the numerator of the coefficient 51 i® 
simply the inner product of relative position and velocity vectors at the initial time. If these two vectors 
are orthogonal, the coefficient q\ will be zero. This condition can be seen to be satisfied in various 
commonly encountered engagement scenarios, one of them being the case of pursuer and evader being 
instantaneously located at the same down-range- cross-range positions while in level flight at different 
altitudes and airspeeds. In these cases, the quartic (65) can be solved for in closed form. All the roots of 
(65) may he computed using the expression 


tj = (70) 

Both 52 and qo will be negative if 6 > a. In this case, the right hand side of (70) will be strictly real. 

Finally, in order to convert the control laws (56), (57) to explicit feedback form, assume that that 
the current time t is the initial time. In this case, the quantities (tf — t), tf may both be replaced 
by time-to-go t go . Additionally, as discussed elsewhere in this paper, the weapon envelope components 
V u V 2y V 3 are assumed to be available from measurements. Using these, one has 


36 r 

Ut = (a - b) t ,s r* + Zi H 

-eVi 

(71) 

36 r 

U * = (a - 6)(„.= r + Z: H 

-eV 2 

(72) 

36 r 

U 3 = . . 2\ z 3 + Zztgo 

(a - b)t go 2 L " J 

-eV 3 

(73) 

^ = ( . h + 

(74) 


(75) 

W * = (a - b)t,J [** + 

(76) 


This completes the solution of the differential game in the feedback linearized coordinates. However, 
this solution is usable only after transformation to the original coordinates. This transformation may 
be achieved by substituting the expressions (71)-(76) in the expressions (33)-(38) given in the previous 
section. However, in order to conserve space, this step will not be carried out here. 

The guidance laws resulting from the foregoing algebraic manipulations are highly nonlinear and 
coupled. They use full state feedback along with vehicle performance related quantities such as thrust, 
drag, and mass for generating the minimax optimal feedback control settings for the pursuer and the 
evader. It is not difficult to show that these solutions satisfy the Strengthened Legendre- Clebsch necessary 
condition if a > 0, 6 > 0. Investigation of additional second-order necessary conditions and a detailed 
examination of the saddle point properties of this solution will be of future interest. 
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Numerical Evaluation 


The nonlinear guidance law developed in this paper was implemented on a point-mass simulation 
of two aircraft. The vehicle data used for both aircraft is similar that of a high performance aircraft 
used in previous aircraft pursuit-evasion studies [4,15]. The equations of motion were integrated using a 
fourth-order Runge-Kutta method and the aerodynamic coefficients and the thrust limits were linearly 
interpolated from a stored table. The time-to-go quartic (65) was solved using the Newton’s method with 
zero as the initial guess. The derivatives of the weapon envelope components required in the guidance 
law computations were obtained using a second-order linear observer of the form 


Xttro — ^ uo) 2l/UJ£t£H> 

(77) 

Vwo = v 2 {yw - Vwo) - 2uvy wo 

(78) 

h^wo ~ ^ (^u; h wo ) 2 vu) H-yjo 

(79) 


In (77)-(79),u/, v are the observer natural frequency and the damping ratio, respectively. The values used 
in the present n um erical study are u) — 10, v = 1. This observer uses the weapon envelope components 
y wy h w as the inputs to form the derivative estimates i WO ) Vwoy x wo , y woy hwo. These estimates are 

then used in computing the controls for the pursuer and the evader. 

Although several runs have been made, results from one engagement scenario will only be presented 
in the ensuing. In this study, a highly eccentric prolate spheroid weapon envelope with its major axis 
aligned along the pursuer’s velocity vector was considered. This weapon envelope had a minimum range 
of 10 m and a maximum range of 500 m. The two vehicles are initially separated by 2000 m in down 
range and 5000 m in cross range, with the pursuer behind the evader. The pursuer’s velocity vector is 
initially aligned along the down range direction, while the evader’s velocity vector points along the cross 
range direction at the initial time. The pursuer has an initial velocity is 160 m/s while flying level at 16 
km altitude. The evader has an initial velocity of 110 m/s at an altitude of 10 km, with zero flight path 
angle. The present study employed a weighting factor of a = 0.01 for the pursuer and b = 0.03 for the 
evader. The weight on the flight time was £ = 0.5. The engagement time corresponding to these initial 
conditions and the given weights computed from the quartic (65) turns out to be 50.79 seconds. The 
weapon envelope weighting factor e = 1 was used in this analysis. 

Figure 2 illustrates the trajectories of both the pursuer and the evader in the cross-range down-range 
plot. Triangular markers are provided every 5 seconds to give an idea about the relative position of the 
two vehicles. To further aid in interpreting these trajectories, the flight time is indicated at 10 second 
intervals along the pursuer-evader trajectories. The turn-dash behavior of both the pursuer and the evader 
noted in previous studies [4, 12, 15] is apparent from this figure. The altitude histories corresponding 
to this engagement are given in Figure 3. Both the pursuer and the evader have negative flight path 
angles at the time of capture. The heading angle histories for both vehicles are given in Figure 4. From 
this figure, it is clear that the pursuer is attempting to continuously avoid a heading angle match. The 
airspeed histories for the two vehicles given in Figure 5 shows the pursuer slowing down during the initial 
portion of the turn followed by an acceleration. The evader, on the other hand, is accelerating through 
most of the engagement. The load factor, throttle setting and the bank angle for both the pursuer and 
the evader are shown in Figures 6, 7, 8. From these figures, it may be observed that both the pursuer and 
the evader are executing a descending turn. In order to illustrate the influence of the weapon envelope on 
the engagement, a plot of the acceleration history along the altitude direction is given in Figure 9. If the 
weapon envelope were spherical, the acceleration history would have been a straight line, as is the case 
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for the evader. However, the presence of the prolate spheroid envelope introduces strong nonlinearities 
in the acceleration history. The performance of the pursuit-evasion guidance law is apparent from these 
plots. Evaluation of these guidance laws in a more complex vehicle simulation is currently under way. 


Conclusions 

This paper presented the development of feedback guidance laws for aircraft pursuit-evasion. The 
analysis employed nonlinear point-mass models of aircraft. A realistic weapon effectiveness envelope was 
included in the analysis. Assuming that the weapon envelope components may be computed from given 
measurements, the vehicle model was transformed to a linear, time invariant form. The pursuit-evasion 
differential game was formulated using this model and the solution obtained. The performance index 
employed consisted of a linear combination of flight time and the square of the vehicle acceleration. 
Inverse transformation of this solution produces a nonlinear guidance law together with a quartic for 
the computation of the free final time. This guidance law is in closed-loop state feedback form and uses 
the vehicle performance data. Modifications of the present guidance law to include a sensor effectiveness 
envelope were sketched. 

Numerical results using high performance aircraft data were given. Since the computational re- 
quirements for the guidance law are modest, it appears that this solution is implementable on-board 
aircraft. 
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Fla 2 Pursuer-Evader Trajectories In the Horizontal Plane. 
Dark Triangle : Evader, Light Triangle : Pursuer. 
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Fla. 3. Temporal Evolution of Pursuer- Evader Altitudes. 
Dark Triangle : Evader, Light Triangle : Pursuer. 


17 




AIRSPEED, m/S 



8 





LOAD FACTOR 



TIME, S 


Fig. 6. Load Factor Histories for the Pursuer and the Evader. 
Dotted Line : Evader, Solid Line : Pursuer. 
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Fig. 7. Temporal Evolution of Pursuer-Evader Throttle Setting. 
Dotted Line : Evader, Solid Line : Pursuer. 
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Fig. 8. Bank Angle Histories for the Pursuer and the Evader. 
Dotted Line : Evader, Solid Line : Pursuer. 
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Fig. 9. Altitude Acceleration Histories for the Pursuer and the Evader. 
Dotted Line : Evader, Solid Line : Pursuer. 
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